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Abstract. We show that all smooth solutions of model non- linear 
sums of squares of vector fields are locally real analytic. 



1. Introduction 

We consider sums of squares of non linear vector fields, that is equa- 
tions such as 

q 

P(x,u,D)u = Y,Xfu = f 
1 

with the new feature that {Xj} may depend in their "coefficients" 
on the solution u. As a prime example of this class we consider the 
following case (for r > 0): 

(1.1) P u (D)v := ((D x ) 2 + (x r D t ) 2 + (x r h(x,t,u)D t ) 2 ^) v, (x,t) E R 2 

with h real valued and real analytic in its arguments. 

We shall assume our solution u to be C°°, since smoothness (starting 
from C 2+a ) follows from the arguments of Xu [7j which are based on 
the subelliptic estimate clearly satisfied by P u and the paradifferential 
calculus of Bony pQ . 

2. Results 

Theorem 1. If f is real analytic near (xq, to), then so is any smooth 
solution to < |j.ip . 

We remark that the problem is significant in its own right and also 
because it bears the same resemblance to general quasilinear subelliptic 
partial differential equations that the sums of squares of linear vector 
fields do to the subelliptic complexes and 'boundary Laplacians' arising 
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from the df, operator in several complex variables. In particular, the 
local real analytic hypoellipticity of those (in the linear case) with 
symplectic characteristic variety (roughly corresponding to r = 1 here), 
proved independently by Treves and Tartakoff in 1978 (0j, [5], [§]), 
propels one quite reasonably to ask the same question in the quasilinear 
setting, of which the type of operator under study here is a simple 
prototype. (NB - the vector fields arising from eta) correspond more 
directly to d t — yd t and d y + xd t than to d t ,d y , xd t and yd t as separate 
vector fields; nonetheless the "Grushin-type" operators have always 
provided the most tractable models.) 

3. Proof 

Using standard arguments it is easy to prove the following a priori 
estimates: Vs > 0, u G C°° and compact U, 3C = C s ^u :VuG C£°(U), 



(3.1) Yl W X M\l + IMI^ < C{\(P u v,v) s \ + \\vf s } and 

i 

3 3 

(3.2) \\XiXM\*+J2\\ x ^+^+\\ v \\ 2 s+^ < c {\\ p - v W 2 s+\M 2 s } 

i,j = l 1 

where || • || s = || ■ ||^, P u = P{x,u,D), X 1 = D t , X 2 = x r D t , X 3 = 
, x, u)D t , and C depends only on the first s + 3 derivatives 

of u. 

However the estimate we will need uses the maximality (and ar- 
bitrary positivity) of ()3.2j) rather than its subellipticity: with ||H|| S 
defined as follows for s a positive integer, 
(3.3) 

IIMII* = H-^IU (IIMII# S (W) = W xIv \\h s (u)^oi U open) 

|J|<2 |Z|<2 

then for any K and with X 1 = X h X h ...X 1 ^, 3C K : G C™(U), 

(3.4) HMH2 + K \\ xIv h < C\\P u vh + C K \\v\\ . 

[J[<i 

4. The general scheme 

The general scheme, as always, will be to use the a priori estimate 
applied to functions v = (pD m u and then to bring ipD m to the left of P u 
modulo errors which are handled inductively. Noting that the a priori 
estimate provides for maximal control (i.e. no loss of derivatives) in 
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the D x direction, we limit ourselves to estimating ipD^u. (f will be a 
smooth localizing function, namely identically equal to one in a fixed 
open set Uq, where we wish to prove the solution u is analytic, and 
supported in U%, the open set where the data are assumed to be real 
analytic. The localizing function ip(x, t) may be taken to be of the form 
<p(t)tp(x), and terms with derivatives on (p(x) may be disregarded since 
the operator is elliptic when x is away from 0, namely in the support 
of derivatives of <p(x). . Thus for our purposes, ip — <p(t) alone. 
Taking Pu = without loss of generality, we have from (|3.4|) : 



III^IMIIa WXfrDTuh + < \\P V D?u\\ 2 | < J2\\[Xj,ipD™]u\ 



ml ml ml ml 

< y \\g k (x,t,u,u')^x 2r Dr +2 - k u\\ 2 | c \\<px 2r [h{u),D?u]D*u\\2 , 
~ ^ ml ml 

k=l 

(41) ^ c y h {k) X 2 Dr k uh , c \\^[h{u),D?u]Dtuh | ^ 
~ ' ml ml 

k=l 

considering x 2r D 2 = X 2 , writing h(-) = h 2 (-) and estimating the norm 
||<7A;(x, t, u, u')\\h 2 (U!) by a constant. Here the gk(x, t, u, v!) stand for the 
coefficients, aside from x 2r , which enter when ip is differentiated once 
or twice, and the dots denote terms arising from lower order terms 
in the operator P, terms containing fewer X's. 

We focus on the bracket in the last norm, the crucial one. To expand 
(D™ h)(u(x, t)),we will need to use the Faa di Bruno formula or rather, 
what will suffice, and probably be more transparent, crude bounds for 
the results: writing 

D?g(u(x,t)) = {u'D u + Ar-VA^MM)); 

with primes on u denoting t derivatives, writing this roughly as 

D?g{u{x,t)) = {{u'a + D t r- l u\ a=Dg )g, 

i.e., a becomes a 'countem' for the number of derivatives received by 
g. Then this is at worst 



(4.2) [^)9 {m ' m '\Dfu ,m - m ). 

Finally, distributing a objects into (3 positions yields 

T^a„./b „./( Q i) „.l( a b) 
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Thus we have 



[D?,P] = y. (Dfh(u(x,t))) {x*D*)D 



m—m 
t 



to! ^— ' m'! (m — m')! 

m'=l 



with (cf. g2D): 



{Df)h(u(x,t)) ^ h (m'-m") (fl™" 

m'\ ' (to/ — to")! to"! 

m'-m">l K ' 



m' —m" f 



y — v 



or in all, with (gUJ), 

(4.4) HlyA m "lll2 < \\W\PM\ | <y- l|y (fc) x 2 A m '^i| 2 | ^ 



m TO! 14 — ' TO! 

fc=l 



h (m'-m") D rn lu , ^ V^ u » (^nJ^,. ...^ 

V 3 



2 



(to' — to")! to"! ^m'-m"' (m — m')! 
where the sum is over m > m' — to" > 1 , YlJLi™ m j = m " an d so 

E^r"K +!)="»'■ 

By associating x 2r with a different term in the product if necessary, 
we may assume that the last term is of greatest order, and hence that 
the others are of order at most to/2. 

5. Remarks on the last sum 

Several remarks are in order concerning the last right hand side. 

First of all, in utilizing the property that if 2 is an algebra to take the 
product of norms, there will occur a constant raised to the to'— to". But 
this is allowable, since there are m! — to" derivatives on the (analytic) 
function h where we expect a constant to that power. 

Secondly, that power that power always corresponds to the increase 

in number of terms of the form D t 3 u'/m"\ inside the norm; in the end 
the number of these terms cannot exceed to, hence the constant cannot 
exceed C m . 

Thirdly, we will associate the localizing function ip, with the highest 
order term and take it out of the norm, introducing another one which 
is closely related to the number of derivatives in that term - in this 
case m — to'. In bringing ip out of the norm there may be one or two 
derivatives (or three or four, given the first terms on the right of (|4.4Jl ). 
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and while they will presumably balance quite well with ml we need to 
be sure that they balance as well with (m — m')l when m — m' may 
actually be rather small (a large drop may have occurred all at once). 
To this end we make the following observation: as m drops from m 
to m — m', there have appeared m' — m" new lower order terms, or 
m' — m" + 1 terms of no greater order, counting the principal one. 
Thus we have 

j ji 

(5.1) (m — m" + l)(m — m) > m\ i.e., < m! — m" + 1, 

m — m' 

the same factor that occurred before, and appears in the number of 
derivatives on h. Thus, again, we can afford (m/m — m') 4 without 
danger. 

The fourth observation concerns the effect of the sum. The sum 
corresponds at most to the number of ways to partition m derivatives 
among at most m functions, generally many fewer. Denoting by D a 
derivative (m of them) and by u a copy of u (t of them) we are faced 
with the number of ways to 'identify' or select t items (the u's) from 
among m + t items (the D's and w's) with the understanding that in 
an expression such as 

(5.2) DDDDD u DDDDD u DDDDD u DDDDD u--- DDDDD u 



m D's and t(<m) u's 

the D's differentiate only the first u following. The answer is that there 
are certainly not more than ( m ^*) < 2 m+t < 2 2m = 4 m ways. And while 
we have written this out only for the first complete iteration of the a 
priori estimate, it is a remarkable fact that the form of the sum does not 
change after multiple passes, and hence the number of terms involved 
is subject to the same bounds. What is more, the same analysis applies 
after iteration of ()4.4|) (cf. below) and thus the sum will also not pose 
a difficulty in proving analyticity and may be replaced by a supremum 
below. 

Finally, when these considerations enter and readability is an issue 
due to the length of lines, we shall tacitly replace the sum by a supre- 
mum and omit a constant such as C m ~ m +2 . 



6. The localizing functions and introducing new ones 

Proposition 1. For any two open sets Qq (e Qi, with separation 
d = dist.(Qo, flf) and any natural number N, there exists a univer- 
sal constant C depending only on the dimension and a function \& = 
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(6.1) \D^\<1^\ N m , \(3\<2N, 

The first localizing function, (p = \I/ m , satisfies: 

(6.2) * m ElonW 0l f m e C °°(W 1/m ) , < c k m k , k < 4, 
where we have set, for a > : 

(6.3) W a = {(x,t) G U x : dist((a;,t),Wo) < a(dist(W , U{)} . 

When the first localizing function needs to be replaced but, say, 
rh derivatives of u remain to be estimated, we shall localize it with 
a function identically equal to one on Ui/ m , the support of \I/ m but 
dropping to zero in a band of width 1/rh of the remaining distance 
(a(l — 1/m)) to the complement of U±, i.e., supported in 

(6.4) = = Wj.n.ijn.u. 

m v m yv 771 ' m mm x m ' v m ' 

We shall denote such a function by That is, P ^ CT satisfies: 

m 

(6.5) = 1 on U p , P ^ CT G C^(U p+ i {1 _ p) m Wi). 
Derivatives of p \l/ CT satisfy, with universal constant C: 
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(6.6) |^(A)|<C^-j , k<4. 

uniformly in p, a. Of course any other (fixed) bound for k would do. 

While it is true that we could just write ||ym>|| s < c||<p|| s ||u>|| s , for 
s > 2, to do so would incur at least two derivatives on (p with no gain 
on w. To avoid this difficulty, we use the following finer estimates of 
the if 2 norm of product of functions. 

Proposition 2. If(p,<p are two smooth, compactly supported functions 
with <f = 1 on supp if then for every p > 2 

(6.7) \WD p u\\ 2 < C 2 sup \\D q y\\ L oo\\CpD p - q u\\ 2 , and 

q<2 



(6.8) yD^uh < C 2 sup II^VlUocll^-^H^^ v) 

q<2 
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7. Expanding the norm of the product in f)4.4j) 

The norm of the product in ()4.4j) will be replaced, as announced, by 
the product of the H 2 norms, most of which will have as new functions 
i/m^m'j '■ multiplying through by m, 



(7.1) 



\ V m D?u\\\ 2 
(m - 1)! 



k=l 



(m- 1)! 



+ sup 

m>m' — m ff > 1 

\ -> m 1 —m ,f a it 

> , m!—m" 

/s-^m* —m 11 , it , 1 , /\ 
OZj =1 (m , j + l) = m') 




C\\ l/m m m »D t 3 u'\\ 2 \ \L^ m X 2 D 



U \\2 



m"-\ 



(m — m' — 1)! 



where, using (j5.1|) . the factor m/m—m' which entered on the right from 
multiplying through by m and decreasing the last denominator by one 
is absorbed in a slightly larger constant C m ~ m in the product. We 
have also bounded the terms \\h^(x,t,u)/r\\\ by C r and distributed 
these constants, one per term in the product of norms of derivatives of 
u'. 

To unify these two types of terms we could combine them into one 
sum, over k + m' > 1, but there is nothing new introduced by consider- 
ing the couple of extra derivatives which the localizing functions may 
receive - there is compensation with decrease in m and we have already 
seen this effect - it is essentially one familiar in elliptic regularity proofs 
by L 2 methods, so we will omit the terms with k > 0. 

Now we have seen that we may bring the last localizing function, 
o^m\ out of the last norm and introduce the next function, i/ m \l/ m _ m ', 
identically equal to one on the support of o^m, with a larger constant 
Ch- According to the above Proposition, when bringing a localizing 
function out of the norm its L°° norm will contribute up to two or, 
if already differentiated, perhaps four factors of m with corresponding 
decrease in the number of derivatives on u. This disturbs the balance 
between number of derivatives and the factorial, but (|5.1I ) shows that 
even factors of roughly (m/m—m) 4 merely serve to modify the constant 
Ch', we conclude that we may pass from one localizing function to the 
next without problems. 

That is, applying (|7.1|) to its own last term, with m replaced by 
m — m', and ignoring k > for simplicity, we have, denoting by — = 

1 — (1 — -M(l — ^ , ) the band used up by the supports of the first two 
localizing functions, which will depend on the choice of m', and once 
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again ignoring the first term on the right, 

mi ,t, 7->m— m' 1 1 1 

\l/m^m-m'jJ t U\\\2 



(7.2) 1 

+ sup 
or together, 



< 



r 

■'-jT C\\ 1/m2 * p ,jDS'h \ \\yA-m' X*Dr m '- pl u\\ 2 

3=1 



p''\ J (m — m' — p' — 1)! 




! |i !•■/,;-•- •/ "III- ^ III ,|l/ m m "A U l|2 , 

( 7 " 3 ) —JZ. ~ SU P din X 



m r 



I iTr v2 T\m—m —p n 

(m — m' — p' — 1)! 

where the supremum is over both sets of indices: m > m' + p' — m" — p" 

and X]j=i" m Sfc=i P ( m i + Pk) = m " + P" so ^ we se ^ s' = m! + p' 
and s" = m" + p", we have a sum over all m > s' — s" > 2 and 
Z^ S j+k=2 s j+k = s " while after the first iteration the sum was over all 

indices such that m > m' — m" > 1, Y^i™ m j = m " ■ ^ n both cases, 

and for all succeeding ones, the number of such possibilities was seen 

by O to be bounded by C m . 

We continue this process, pulling the localizing function i/ m ^ m _ m ' 

out of the last norm and replacing it with i/m 2 ^m-m'- P ', subjecting 

that term to the a priori estimate, etc. Each time there is a whole 

'spray' of far lower order terms, but the number of these is s' — s", each 

has a suitable localizing function which will let us pass to a subsequent 

one by placing one (universal) constant with each new copy of u', and 

in the end we have a product of on the order of m terms of the form 

||-D r w||_H-2( Wl ) all of order r < 4, say. (After all, localizing functions need 

not be introduced at the last stages - or even in any of the above, until 

we need to estimate a given term carefully - for instance, in the product 

// 

\\D t J u'\\ h 2(jj j 

in (|7-H) the terms could easily have been left as - f — ^-,) at least 

until the time came to subject that term to the a priori estimate to 
reduce its order (in case all other terms had been reduced to lower 
order). 

We also need to remark at the end that what was true for the first 
localizing function, namely (j5.1|) . will be a little different, since the 
next localizing function may bring not a factor of m — m! with each 
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derivative it receives but rather the factor (cf. (jfi.fijT 



m — m! , ,. I m 



(m — m!) 



1 — - \ m — 1 

so that, passing from m — m! to m — m' — n' we encounter instead of 
just 

m 



< m' — m" + 1 
m — m' 

an extra factor of m/m — 1, possibly to the fourth power; and this may 
keep occurring as the order of the leading term keeps decreasing. For 
instance, after a few iterations, the analogous 'extra' factors from 
will be 

m \ I m — rrii \ ( m — rrii — m 2 



m — 1 J \m — rrii — 1/ \ m — m i — m 2 ~ 1 

or even the fourth power of such a product. But there cannot be more 
than m terms in the product and each factor is far less than 2, leading 
to an easily acceptable constant C m in the end. 

This will prove the bounds for the laft hand side of (J7.1|) 

IM^tMh < C m + 1 



(m - 1)! 

uniformly in m and hence the analyticity of u in Uq. □ 
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